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Abstract: Teleparallel theories of gravity have a long history. They include a special case 
referred to as the Teleparallel Equivalent of General Relativity (TEGR, aka GRy). Recently 
this theory has been generalized to /(T) gravity. Tight constraints from observations 
suggest that f(T) gravity is not as robust as initially hoped. This might hint at hitherto 
undiscovered problems at the theoretical level. In this work, we point out that a generic 
f{T) theory can be expected to have certain problems including superluminal propagating 
modes, the presence of which can be revealed by using the characteristic equations that 
govern the dynamics in f{T) gravity and/or the Hamiltonian structure of the theory via 
Dirac constraint analysis. We use several examples from simpler gauge field theories to 
explain how such superluminal modes could arise. We also point out problems with the 
Cauchy development of a constant time hypersurface in FLRW spacetime in f{T) gravity. 
The time evolution from a FLRW (and as a special case, Minkowski spacetime) initial 
condition is not unique. 
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1 Introduction 

Einstein proposed the idea of teleparallelism, or Fernparallelismus (aka absolute paral- 
lelism, distant parallelism), in order to unify gravity and electromagnetism [1]. Unlike 
general relativity, spacetime in teleparallelism is equipped with a connection with zero 
curvature, but non-vanishing torsion. Since the spacetime is flat, the parallel transport 
of a vector is independent of path, and hence the name te/eparallel, meaning "parallel at 
a distance". Einstein's quest for unification via teleparallel theory can be found in the 
interesting account of [2]. Despite Einstein's failure to formulate a unified theory, telepar- 
allelism was later revived and studied as a pure theory of gravity. The main motivations 
were (1) that the teleparallel formulation was considered to have advantages with regard 
to the identification of the energy-momentum of gravitating systems, and (2) teleparallel 
theory can be regarded as a gauge theory of local translations (see, e.g., [3-11]).^ In fact 
Einstein's general theory of relativity (GR) can be rewritten in teleparallel language (see, 
e.g., [13-16]). The result is a theory equivalent to GR known variously as the Teleparallel 
Equivalent of General Relativity (TEGR) or GRy. Recently TEGR has been generalized to 

""^For recent study of teleparallelism as a higher gauge theory, see [12]. 
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/(T) theory, a theory of modified gravity formed in the same spirit as generahzing general 
relativity to f{R) gravity [17, 18]. 

It seems however that observational data puts rather tight constraints on /(T) gravity, 
effectively reducing the theory (at least for some choices of f{T)) back to TEGR (However, 
see also [19] and [20]). This includes constraints from considering a varying fine structure 
constant [21], the formation of large-scale structures [22] and a varying gravitational con- 
stant [23], as well as solar system constraints [24]. This could hint at problems with f(T) 
gravity at the theoretical level. 

In this paper we will briefiy review TEGR and more general theories of teleparallel 
gravity, including f{T) gravity, in section 2. In section 3, we discuss the degrees of freedom 
for f{T) theory. We know that there are extra degrees of freedom in the theory beyond those 
of GR; that the number of extra degrees of freedom is generically 3 was clearly revealed by 
the Dirac constraint analysis carried out by Li et al. [25]. Despite this, in a careful linear 
perturbative analysis up to second order on flat Priendmann-Lemaitre-Robertson- Walker 
(FLRW) background, no extra mode appears [26]. In the case of Minkowski background, 
this had already been pointed out in [25]. Disappearance of extra degrees of freedom is 
known to happen in some gravity theories (e.g., in massive gravity, see [27]); this can be 
caused by the accidental disappearance of kinetic terms at the linear level. If so, as we 
shall argue, it is likely to be a signal of superluminal propagation. We will also discuss 
the related danger of the nonlinear constraint effect, which plagued some other theories of 
gravity in the past. Our conclusion is that /(T) theory is very likely to give rise to unwanted 
problems due to the effects of nonlinear constraints, in addition to propagation outside of 
the null cone. As a consequence, the number and type of degrees of freedom could vary 
as a physical system evolves. This is further explored via the method of characteristics in 
section 4. 

The method of characteristics is properly introduced in section 4, where we also provide 
an illuminating example in an application to the nonlinear Proca field. The same example 
is given a Hamiltonian analysis that clearly shows the close connection of superluminal 
propagating modes and nonlinear constraints. Although the characteristic equations allow 
one to obtain superluminal propagating degrees of freedom, one must be careful to distin- 
guish between physical degrees of freedom and gauge degrees of freedom, as the example 
of the Maxwell field illustrates. 

In section 5, we investigate the dynamical equations in f{T) theory and derive a sys- 
tem of partial differential equations that describe the characteristics of the theory. This 
calculation reveals nonlinear features that are very prone to allow superluminal propaga- 
tion outside of the metric null cone.^ Furthermore, we also show that the constant-time 
hypersurface in FLRW spacetime, and as a special case Minkowski spacetime, can be a 
Cauchy horizon, which means that even the infinitesimal future time evolution from it is 
not unique. We conclude with some discussion in section 6. 

^We should remark here that superluminal propagation may or may not be achronal, so it is not certain 
that such propagation really does lead to a violation of causality. We make further comments on this in 
section 4. 
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2 Teleparallel Theories of Gravitation 



Let M be a semi-Riemannian manifold of signature (— , +, +, +) with metric g. The telepar- 
ahel theories we wish to discuss'^ use the vierbein field or tetrad ea{ x^) as the fundamental 
dynamical variables. We use Greek letters for spacetime coordinate indices and Latin 
letters to label the tetrad vectors. 

Suppose that the manifold is parallelizable, i.e., there exist n vector fields {vi, ...,Vn} 
such that at any point p £ M the tangent vectors Vi\p^s provide a basis of the tangent 
space at p. Note that this particular set of vector fields must be able to span the tangent 
spaces at all points of the manifold, i.e., it is a set of everywhere linearly independent 
globally non-vanishing vector fields. Then we can view the mapping between the bases of 
the tangent space in a coordinate frame {d^} to that of a non-coordinate frame {e^} as 
an isomorphism TM — )• M x M^. Orthonormality of the tetrad is thus imposed by the 
Minkowski metric on the tangent space TpM = M'^'^ = (R^, ry), related to the metric on the 
manifold M by g^^ = r/,(,e"^e^^. 

One then equips the manifold M with the metric-compatible Weitzenbock connec- 

w 

tion [30, 31] V defined by 

w 

VxY := {XY'')ea, (2.1) 
where Y = V^Ca- Equivalently, the connection coefficient is 

In local coordinates, the components of the torsion tensor are 

T\u ■■= - = e,\d^e\ - d,e%) + 0. (2.3) 

The Weitzenbock connection can be shown to be curvature-free.^ 

The Weitzenbock connection differs from the Levi-Civita connection by the contortion 
tensor^ 

W 1 / «) w w \ 

Kt'% = --( T'^'p - T^f'p - Tpf"" J . (2.4) 
The Einstein-Hilbert action of general relativity is (in units where c = 1) 

S = ^ f ct^x V^R. K = 8ttG. (2.5) 



2k „ 

Up to a total divergence (which does not affect the field equations) this action is equivalent 

S = -— / d^x \e\T, (2.6) 
2k ,1 



^There are more general teleparallel theories which also have nonmetricity [7]; in fact there are sym- 
metric teleparallel theories — with vanishing torsion but non-vanishing nonmetricity — including one that is 
equivalent to GR [28, 29]. 

^There is an alternative formulation for teleparallel theories that has some advantages; it uses the 
teleparallel connection as an a priori independent dynamical variable that is constrained to have vanishing 
curvature via a Lagrange multiplier [4, 5, 7, 33-35]. Here, to more simply communicate our ideas to the 
audience we most want to reach, we will use the formulation familiar in the f{T) works. 

^We remark that the proper term is contortion instead of the often used "contorsion" . See e.g., [32]. 
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where e = det (e"^), 

UI 1 / U) w w \ 

■■= 2 + ^pT^^a - S^T^^aj , (2.7) 

w 

and T is a certain quadratic-in-torsion scalar: 

www 

T := Sp^^TV, (2.8) 

the latter is the so-called torsion scalar. 

In this teleparallel language, (2.6) is called the TEGR action. f{T) gravity simply 
promotes the "torsion scalar" in the TEGR Lagrangian to a function, i.e. the action of 
/(T) gravity is^ 

S = jd^x |e|/(T). (2.9) 

One can then obtain the field equation for f{T) theory of modified gravity by varying 
with respect to the frame e"'^, this leads to 

X/ := d.iefrSr) " ee^^T^, V'^/t - ^ee// = 0. (2.10) 

We remark that the (quadratic-in-)torsion scalar defined above is a very special one. 
We note that mathematically, we can define a more general torsion "scalar" T by relaxing 
the coefficients:^ 

^ W W WW WW 

T = aT^pTp'^^' + + cTp/T^^',. (2.11) 

In particular one can construct the one-parameter combination 

1 _|_ 9 X WW 1 9\ WW WW 

T\\] = ^^T"^,^ + ——TP^,,T^% - Tp/T'-f',, A > 0. (2.12) 

Years ago a teleparallel theory using this as the Lagrangian was proposed as an alternative 
to GR. This theory passed all observational (as of, perhaps up to, 1990) [37, 38] and some 
theoretical tests [39]. But then certain theoretical problems were found, which we will 
discuss in the next section. 

We summarize how teleparallel theories work: We start with choosing a good local or- 
thonormal frame {ca} which is declared to be covariantly constant under parallel transport 
by imposing the Weitzenbock connection. This defines a global frame field that sometimes 
goes by the name orthoteleparallel frame or OT frame. Recall that there are infinitely 
many possible choices of frames that span the same tangent space at p S M, related to 
each other by elements of the proper orthochronous Lorentz group. The torsion tensor and 
consequently the torsion scalar, can thus be expressed in terms of any one of these infinitely 
many bases. Once we get a parallelization, it is defined up to global Lorentz transformation. 

®For simplicity here we follow the somewhat non-standard convention used in [26], in which the factor 
-(2k)"^ is absorbed into /, i.e., TEGR corresponds to /(T) = -(2K)"^r. 

^In fact one could also include two more quadratic-in-torsion combinations which have odd parity; they 
can be found in Baekler and Hehl [36]. 
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This defines a Weitzenbock geometry. The set of all possible parallelizations of M is then 
partitioned into equivalence classes consisting of parallelizations that relate to each other 
in the same class by a global Lorentz transformation (thus, one cannot pass from one paral- 
lelizations to another by a local Lorentz transformation). Note that at this stage, nonzero 
torsion does not mean that we have some "distinguished frames" or preferred frames; any 
equivalence class of frames are on the same footing, in the sense that they parallelized the 
spacetime equally well. However, once we introduce the physics via the action, we get field 
equations. Now we can only admit certain classes of parallelizations, just as in general 
relativity where we can only admit metrics that satisfy the Einstein Field Equations. 

We remark that due to the lack of local Lorentz invariance (See also the discussions 
in [40] and [41]), unlike in general relativity where we can change coordinate systems and 
use any frame fields, this is not so in generic teleparallel theories. As a consequence, this 
implies that we cannot directly extract the tetrad from the metric in the straightforward 
way. TEGR is special since its action does not determine the admissible OT frame but 
only the metric. From now onwards, we will suppress all the explicit overscript w^s on the 
torsion scalar and connection coefficients etc. 

3 Extra Degrees of Freedom and the Danger of Nonlinear Constraints 

We are reminded of the problem faced by the one-parameter teleparallel theory in which 
the teleparallel Lagrangian has the form (2.12). This theory has 4 extra degrees of free- 
dom [42]. Kopczyhski [33] first pointed out that this theory has predictability problems; 
he argued that one is unable to determine uniquely the evolution of the teleparallel ge- 
ometry. However, further analysis [43] showed that the problem is not generic, instead 
it only occurs for a special class of solutions. Cheng et al. verified this behavior using 
the Hamiltonian approach and identified the troubles as a certain effect of nonlinear con- 
straints, there referred to as constraint bifurcation [42], in which the chain of constraints 
could "bifurcate" depending on the values of the fields. That is to say, the number or type 
of constraints would depend on the values of the phase space variables. One could imagine 
that the evolution of some system as depicted by a curve in phase space could pass through 
regions with different number or type of constraints. This would be rather strange (since 
this means that the number of gauges and physical degrees of freedom also change), and 
one will find it hard, if at all possible, to predict the evolution of the system. To put it 
in another way, the matrix with the Poisson brackets of the constraints as entries may 
not have constant rank, due to the nonlinearity of the constraints. The number or type 
of constraints may change as one approaches a point in the phase space where the rank 
changes. The more general Poincare gauge theory also has similar problems [44-46]. We 
suspect that this might also be the case for /(T) theory (and probably for many teleparallel 
theories). It is of course conceivable that with a specific choice of /(T), we could prevent 
the solutions from ever approaching such problematic regions in phase space, e.g., if they 
would act dynamical repeller. 

The number of degrees of freedom of a generic /(T) theory of gravity (in (3 + 1)- 
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dimensions) has been found to be five by Miao Li et al. [25] using a Hamiltonian approach.^ 
That is, there are three extra degrees of freedom, which the authors suggest could corre- 
spond to one massive vector field or one massless vector field with one scalar field. To 
understand the properties of the degrees of freedom in a theory, one should first try to 
analyze them by a perturbative approach. However, it could happen that not all degrees 
of freedom show up in the perturbation level. 

For the case of f{T) gravity, as remarked by Li et al. at the end of their paper [25], for 
some special backgrounds (including Minkowski space with e'^^ = (5'*^), some of the extra 
degrees of freedom do not appear in the linear perturbation. It is also found that [26], on 
the flat FLRW background with a scalar field, linear perturbation up to second order does 
not reveal any extra degree of freedom. In other words, f{T) theory is highly nonlinear: 
the number and type of constraints in the linear theory is different from that of the full, 
nonlinear theory. This is also one of the problematic features discussed in the context 
of Poincare Gauge Theory [44-46]. The worry of such behavior is that linearized modes 
which are "good" may cease to be so in the full nonlinear theory, perhaps accompanied by 
anomalous characteristics, as we will further comment on in the later sections. 

The revealing Hamiltonian discussion of Li et al. for /(T) gravity in (3+1) dimen- 
sional spacetime is rather complicated and some of the results are only inferred implicitly. 
However they also worked out explicitly the analogous but somewhat simpler Hamiltonian 
formulation for f(T) gravity in (2 + l)-dimensions; for this case the problem we wish to 
address is clearly evident. In this case there are six first class constraints^ {H, Hi,Il°'^), 
where o = 0, 1, 2 and i = 1,2, as well as four second class constraints (L-^, F^, F-"^^, vr), which 
leads to two degrees of freedom. 

They define the quantities 

yi = {Ho,r}, 2/3 = {i/o,ri'}, 

xo = {Ho, n}, Xi = {r,TT} , X3 = {r, vr} , (3.1) 

and 

Ai = {T\r^^} (3.2) 

« 2e [9°^ {g'^'g'"^ - g^'g'"^) + g'\g'"^g'' - g'^g'') - g'\g'"^g'' - g'^g'')] dm4>. 

(3.3) 

The self-consistency equation in matrix form is shown to be M^oK^£) = 0, where K^d = 
(1, Al, A2, As, A)^ and 

/ yi 2/2 2/3 xo\ 

-yi Al xi 
M3D = -y2 A2 X2 (3.4) 

-ys -Al -A2 3:3 
\-a;o -xi -X2 -x-s 0/ 



His analysis was based on the Hamiltonian formulation of Maluf [47, 48]. Additional insight could 
probably be found using the other more general Hamiltonian approaches to gravity theories which specialize 
to teleparallel theory [5, 34, 49, 50]. 

®We do not explain notations of Li et al. [25] here since we will not need them in detail. 
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satisfies det M^d = 0. This matrix generically is of rank 4. However, as previously men- 
tioned, some of the Poisson brackets could vanish in some cases, which would result in rank 
changes of the matrix as the system evolves. Miao Li et al. found the explicit formulas for 
the Lagrange multipliers: 

Al = ^2a;o + X3y2 - X2yz ,^ 

A1X2 - A2X1 
^ -Aixo - X3yi + xiys 

(3.6) 



A1X2 - A2X1 
yiX2 - y2Xi 
A1X2 - A2X1 ' 



(3.7) 



X = pl^^. (3.8) 
A1X2 - A2X1 

While they were interested in the most general case and thus only considered the case with 
A1X2 — A2X1 / 0, this type of special case is precisely what we are interested in in this 
work. Indeed one sees that in principle ^1X2 — A2X1 can vanish, yet the numerators of 
these expressions are not generally constrained to vanish at the same time, which allows 
one or more of the Lagrange multipliers to become unbounded. As we will explain below, 
this is signaling a superluminal propagation mode. 



4 The Method of Characteristics 

A viable theory of gravity should satisfy certain theoretical criteria, including the lack 
of tachyonic modes, or modes that carry negative kinetic energy (i.e., ghosts). The 
theory should also support a well-posed initial value problem, that is, satisfy the Cauchy- 
Kowalevski theorem [51-53]. Moreover any propagation mode in the theory should also 
be described by hyperbolic quasi-linear partial differential equations with well-behaved 
characteristics, i.e., the characteristic surfaces should be non-spacelike. One should also 
consider a theory with "good" Minkowski limit as preferable to a theory that does not have 
such behavior. 

The method for studying characteristics is well-known; see for example the detailed 
classical work of Courant and Hilbert [54], or Chapter 8 of the more recent text of [55]. A 
discussion of the method of characteristics in the case of general relativity can be found in 
Lecture 14 of [56]. For more rigorous treatment see [57], as well as in [58]. See also [59] 
for some useful discussions in the context of external fields in gauge theories. Here we only 
give a brief summary. Recall that if P is a linear differential operator of order k, then we 
can consider P as a polynomial in the derivative D. In multi- index notation we can write 
this as 

P = 5^ aa(^)I?". (4.1) 

|a|<fc 

We may ask in what directions it is really of order k. For an ODE ak{x){d/dx)^+...+aQ{x) = 
for example, it is obvious that ak{x) 7^ is the condition required for the equation 

^"However, one may still argue that theories with such seemingly pathological features may still be 
acceptable. We will make further comments in section 6. 
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to be of order k everywhere. For PDEs with multiple variables, we need to be careful. 
Consider for example the Laplacian d'^/dx'^ + d^/dy'^ in M?. This is clearly of order 2 
in both X- and y-direction, however for an operator which is mixed, e.g., d'^/dxdy, it is 
not of order 2 in either x- or y-direction. Nevertheless, it is a second-order operator, 
which can be revealed by introducing new variables, t = x + y,s = x — y, which renders 
d'^/dxdy = {l/A){d'^/dt^ — d'^ /ds^). That is, this operator is of order 2 in the s and t 
directions. In general then, given any operator P at x, and a direction v, we can make an 
orthogonal change of variable so that v points along one of the new coordinate axes, say 
the xi-axis. Now, if the coefficient of the partial derivative {d/dxi)^ (in the new coordinate 
system) is nonzero at x, then we can say that P is of order at rr in the direction v. We 
refer to such situation as noncharacteristic. That is, characteristic refers to the case in 
which the coefficient of {d/dxi)^ vanishes at x. 

The total symbol or top-order symbol of P is simply a polynomial obtained by replacing 
the derivative D with a variable, say ^. The principal symbol, denoted o"p(^), is the highest 
degree component of the total symbol. That is. 



The principal symbol almost completely determines the qualitative behavior of the solutions 
of the system. Furthermore, it is well known that for hyperbolic (as well as parabolic) 
partial differential equations, the zeros of the principal symbol describe the characteristics 
of the system. That is to say, the characteristic directions are exactly those for which the 
principal symbol vanishes. Indeed the "standard procedure" is to begin with the equation 
of motion or the field equation, keep only the highest derivative terms, and then replace 
the said derivatives d^dy ■ ■ ■ with components of the normal vector to the characteristics 
k^ky ■ ■ ■ k^ and set the equation to zero. This is called the characteristic equation. We can 
study how the characteristics propagate by looking for what sorts of vectors are allowed as 
solutions to the characteristic equation. 

Note that for a characteristic equation that is a matrix equation, called the character- 
istic matrix, it suffices to consider the vanishing of the determinant of the characteristic 
equation (known as the characteristic determinant) instead of the vanishing of the equation 
itself (the latter is of course a stronger statement). 

In this paper, we are interested in one (or a few) special characteristic direction that 
gives the signal of a superluminal mode. By analyzing the characteristic determinant, we 
can examine all directions at once. 

The characteristic surface which is orthogonal to the characteristic direction coincides 
with the edge of Cauchy development, i.e., the Cauchy horizon. This is because the higher 
order derivative term with respect to the characteristic direction disappears. The disap- 
pearance of the higher order derivative term causes the evolution to be singular. Thus, the 
Cauchy development of the characteristic surface is only the characteristic surface itself 
and the evolution from it is meaningless even if it is spacelike. 

Note that for application in theories of gravity, we would like our characteristic direc- 
tions to be null or spacelike, as a timelike characteristic direction is the signal of superlu- 




(4.2) 



a\=k 
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minal propagation (and could violate causality, although this is not necessarily so even in 
theories with local Lorentz invariance [60-62]). Correspondingly, there should not be any 
non-trivial solution that corresponds to any timelike vector. 

4.1 Example: Nonlinear Proca Field 

The characteristic method is best illustrated via an example or two. 

Consider the Lagrangian of the Proca field with a nonlinear term [63] : 

C = - V^'^F^, - ^m^Af^A^ - ^XiAf'A^f, (4.3) 

where F^i, = d^A^, — dyA^ and m, A = const. The signature is (—,+,+,+). The field 
equations are 

d^.F^" - m^A"" - XA'^Af.A'' = 0. (4.4) 

Upon taking the divergence of this equation, because the first term vanishes by the anti- 
symmetry of F^'^ , we obtain an implicit constraint satisfied by the system: 

(m^ + XA''A^)dyA'' + IXA^'A'^d^A^ = 0. (4.5) 

For A = 0, this gives di^A'^ = 0, which upon substituting back into the field equation (4.4) 
gives 

{df.d'' - m^)A'' = 0, (4.6) 

which is the Klein-Gordon equation. The characteristic equation for this case is just k/j^k^ = 
0, i.e., the characteristic is null. 

To find the characteristics in the general A case, replace the highest derivative terms 
d'^ A by k^ A. Here means it is not the value of the vector but represents the 
change of the vector in a certain direction. 

We then get the relations 

k-kA" -k''k-A = 0, 
(m^ + XA-A)k-A + 2XA ■kA-A = 0. (4.7) 

A linear combination of these gives 

[(m^ + XA-A)k-k + 2X{A ■ kf] k ■ A = 0. (4.8) 

Now from eq. (4.7) one can see that modes with k ■ A = propagate with null character- 
istics, but from eq. (4.8) one can see that modes with k ■ A ^ generally have non-null 
characteristics. The normal to the characteristic surface could be timelike, indicating a 
superluminal characteristic. In that case there is a Lorentz frame in which k^ = (1, 0, 0, 0). 
In such a frame the condition is 

+ X{-3Al + AiA') = 0. (4.9) 
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We want to emphasize that this tachyonic mode of the nonhnear Proca field can be 
detected using the Hamiltonian formulation, as we now explain. From the Lagrangian, the 
canonical momenta are 

dC 

vr^ = TTT- (4-10) 

That is, explicitly, vr* = Fq and vr*^ = 0; the latter is a primary constraint. 

The Hamiltonian density, constructed according to the Dirac-Bergmann constraint 
procedure [64-67], is [44] 

i^Vi + \F'^Fij - Ao^cTt'^ + \m\A'Ai - Al) + \\{A'Ai - Alf + U7:\ (4.11) 

where the primary constraint vr*^ ~ 0^^ has been included with an unknown Lagrange 
multiplier u. Prom the Hamiltonian evolution equation 

Aq{x) = {Aq{x),H] = u{x), (4.12) 

one finds the meaning of the multiplier, it is the missing "velocity" . Preserving the primary 
constraint leads to the secondary constraint: 

X := dcT^^ + m'^Ao + \{A' Ai - AI)Aq w 0. (4.13) 

The Poisson bracket of the two constraints is 

{7,\x),x{y)] = K + KA'A, - 2,Al)\ 5\x - y). (4.14) 

Generically this is non-vanishing, so they make up a 2nd class pair. 

However, there is an important exception when the RHS vanishes, this is exactly the 
same as the anomalous characteristic condition (4.9). The dynamical consequence shows 
up when we require preservation of the x constraint, which is the relation that determines 
the "unknown multiplier" u: 

~ X{x) = {x{x),H] = u{x)[m^ + X{A'Ai - ?>Al)] + G, (4.15) 

where Q denotes a collection of field dependent terms which, generically, are non- vanishing. 
Consequently, the field "velocity" Aq, given by the Lagrange multiplier, it, becomes un- 
bounded at any point (s) where A := -|- X{A^Ai — 3Aq) approaches 0. That is, 

Q 

lim = lim uix) = lim — — = oo. (4.16) 

This is an indication that, with respect to this constant time spacelike hypersurface, there 
is instantaneous propagation of the Aq mode. The nonlinear constraint has led to a field- 
dependent constraint Poisson bracket value, a signal for the occurrence of superluminal 
propagation, in complete agreement with the previous analysis obtained from the charac- 
teristic equation. 



Here denotes Dirac's weak equality, i.e., it only holds on the constraint surface within the phase 
space. 
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4.2 Example: Scalar Field and Maxwell Field 

One issue that we have to be careful with is the possible presence of gauge degrees of 
freedom in a theory (which was not present in the previous nonlinear Proca field example) . 

In order to appreciate the issue, first we shall look at the simple example of scalar 
fields. We consider the Lagrangian for the single scalar field defined by 

= -^5^</'i5^'/'i. (4.17) 

We know that the scalar field (pi propagates in the null direction. In terms of characteristics, 
we can understand it as follows. The equation of motion is 

d^df'cj)! = 0. (4.18) 

So the characteristic equation for (pi is k^^k^ = 0, which means the characteristic is null. 

Let us introduce another scalar field (p2 which does not appear in the Lagrangian. 
Trivially, the Lagrangian does not change under the transformation (1)2 ^ 4'2 — 4'2-^ fix^)- 
This property is a certain kind of gauge transformation. The equation of motion which 
comes from the variation of the Lagrangian (4.17) with respect to (p2 is trivial, i.e., = 0. 
Thus the characteristic equation for (pi and (p2 becomes k^k^^ x = 0, which is trivially 
satisfied for any k^. The triviality comes from the gauge mode (p2- Since (p2 does not 
appear in the Lagrangian, we should not include it in the discussion of characteristics. 

The situation for any gauge field theory is similar. For simplicity, we consider the 
Maxwell theory of the electromagnetic field; the Lagrangian is 

Cem = --^F^uF^'", (4.19) 

where F^^, = d^j_Ay — dyA^. The equation of motion is obtained to be 

d^F^"" = 0. (4.20) 

The principal symbol yields the characteristic equation 

k^k^'A'' - k^k^A^" = 0, (4.21) 

where we want to seek some nontrivial A^. If /c^ is null then /c^A^ = gives a propagating 
mode on the null cone. However, if /c^ is not null, then the characteristic equation can 
be satisfied by any A^ = Ck^ for arbitrary constant C . In particular k^ can be chosen 
to be timelike, which means that the characteristic direction will become timelike. This 
superluminal mode however, as we shall see, actually corresponds to gauge mode and so it 
is not physical. 

Alternatively we can consider the determinant method. If we write down the character- 
istic equation without removing the gauge degree of freedom, we will obtain characteristic 
matrix that satisfies 

det [k^k^'g^p - k^kp] = 0. (4.22) 
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The left-hand side of this equation is for any k^, and so this equation becomes trivial. 
The origin of the triviality is the gauge degree of freedom. If we operate di, to the left- 
hand of eq. (4.20), it becomes algebraically 0. Therefore, eq. (4.20) has the information 
of only three independent equations. We know the Maxwell field has C/(l)-gauge, i.e., the 
Lagrangian is invariant under the C/(l)-gauge transformation Af^ — )• = Af^ + 5^A. Since 
the gauge degree of freedom A does not appear in the Lagrangian, the variation of the 
Lagrangian with respect to the gauge degree of freedom A must be trivial, which is the 
origin of the above trivial equation. According to the above simple example of scalar fields, 
we should eliminate the contribution from gauge degrees of freedom in the discussion 
of characteristics. Subtracting the contribution from the gauge degree of freedom, i.e., 
considering the 3x3 matrix whose basis can be anything independentof k^, we can obtain 
the exact characteristic equation, which is satisfied only when A;^ is null. A more rigorous 
treatment of the removal of gauge degrees of freedom and the characteristics of Maxwell 
equations can be found in [58]. 

5 Characteristics Equations and Problematic Time Evolution in /(T) 



In this section, we discuss the Cauchy problem for /(T) gravity by the analysis of the 
characteristics. First, we shall determine the characteristic equations of /(T) gravity. 
After that, we will show that a constant-time hypersurface of the FLRW metric does not 
provide a good initial condition and that the time evolution of the FLRW metric is not well 
behaved. Finally, we concretely show the problematic solution where the time evolution 
from a FLRW initial condition is not unique. 

5.1 Characteristic Equation of /(T) Gravity 

From eq. (2.10), we see that the vacuum field equation in /(T) gravity is explicitly given 



Gravity 



by 



0= a^(ee/5/'^) - ee, 



■a 



'T',,S;'^]fT-lee.'f + ee, 



■a 



PSp^-{d,T)fTT. 



(5.1) 



This can be rewritten as 




(5.2) 



The highest order derivative terms are just 




(5.3) 



We recall that Sa is linear in T' 



so we may write 



f pV J,^ flU Oifirpb 

'a a b c 



(5.4) 
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for some Ma 



111/ afS 



^ which is algebraic in the tetrad e'^p. The exphcit form of Ma 
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where the bracket 



Consequently we have 
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Thus, we can re-write eq. (5.3) in a more simphfied form as 



2e 



/tM/%"^ + 2/tt5/^5, 



a/3 



dpdaG^fj- 



(5.5) 

(5.6) 
(5.7) 

(5.8) 



Following the standard procedure we can replace d^dae^ ^ — )• kykafi^ ^ in eq. (5.8) to obtain 
the characteristic equations for f[T) theory of gravity 



fTMf\"^ + 2/tt5/^S,"'^1 k.Ke'f, = 0; where M/'^, 



pv 



(5.9) 



Here the notation indicates that these are not the values of the frame, but rather 
represent the change of the frame in a certain direction. We note that eq. (5.9) thus 
provide 16 quadratic polynomial conditions on the four A;^'s. 
That is, the characteristic matrix becomes 



M 



u /3 
a b 
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kpka ■ 



(5.10) 



Since the tetrad e" always has the inverse Ca^, the property of the matrix is the 



same as that of 



2e 



h h 



(5.11) 



Therefore for convenience, we shall analyze the matrix in eq. (5.11) instead. 

Before discussing the property of the characteristic matrix of eq. (5.11), we must first 
find out the gauge degrees of freedom. We have 16 equations of motion (2.10), but not all 
of them are independent due to gauge degrees of freedom. We actually have four nontrivial 
identities. 

Indeed, let us consider the invariance of the action under an infinitesimal coordinate 
transformation:^^ 



(5.12) 



Geometrically this represents an infinitesimal diffeomporhism. 
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This leads to the transformation of the tetrad e"^ by 

= d, {-e\e) - T%,eC {-e\e) , (5.13) 

and we know this is related to a gauge mode, i.e., — 6*^0,^" = i*"^, where F"" are four scalar 
fields with an internal index. Varying the action with respect to the component 

^a^(gauge) ^ _ (5.14) 
one obtains the nontrivial Noether differential identities [68] 

a^(X/) + e/r%V = 0, (5.15) 

where Xj^ is defined in eq. (2.10). We stress that eq. (5.15) is an identity, i.e., it is 
algebraically satisfied without using the equation of motion eq. (2.10). 

Since the infinitesimal coordinate transformation also leads to a transformation of 
metric, the metric has the information about all the gauge degrees of freedom. Therefore, 
all the degrees of freedom of the gauge modes can be eliminated by gauge fixing of the 
metric components. 

Now, we move on to the discussion of the characteristic determinant which is con- 
structed by the 12 x 12 part of the characteristic matrix in eq. (5.11) having removed 
the gauge degrees of freedom. Here, we analyze the characteristic matrix TW^'"'''^ by the 
decomposition into symmetric and antisymmetric parts with respect to u and A and also 
with respect to /3 and 7. Without loss of generality we can choose an appropriate basis 
to write down the 12 x 12 matrix such that the left half 12 x 6 submatrix represents the 
symmetric part and the right half 12 x 6 submatrix represents the antisymmetric part. 

First, we consider the first term of the characteristic matrix in eq. (5.11). The first 
term of the symmetrized characteristic matrix is well behaved because it is the only con- 
tribution in TEGR. The symmetric part of the characteristic equation is related to the 
metric component, which a priori has 10 degrees of freedom. However we know that 4 of 
the degrees of freedom are gauge modes and we have only 6 physical modes. We claim 
that its rank is generically 6, and indeed after removing the gauge degrees of freedom the 
second order derivatives are associated with all components of the symmetrized tetrad. We 
consider next the antisymmetrized component with respect to v and A. The first term then 
gives zero contribution, which means that the rank of the first term is indeed 6 for general 

To summarize, the characteristic matrix of /(T) gravity, after removing the gauge 
degrees of freedom, can be written as a 12 x 12 matrix, with four 6x6 block submatrices. 
The generic rank of the upper left block is 6, corresponding to the symmetrized tetrads. 

Next, we consider the antisymmetrized components of the second term in eq. (5.11). 
The form of the second term is the product of two S'^^^k^ and its rank is one (since one of 
the eigenvalue vanishes). One can thus form a 2 x 2 matrix at the very center of the 12 x 12 
matrix, in which each element belongs to one of the four blocks, respectively. Together 
with the 6x6 block corresponding to the symmetrized tetrad, we thus get a 7 x 7 matrix. 
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Therefore we have found that the ranks of the first term and the second term of Ad^'^'^^ 
are 6 and 1, respectively. By the fact that S^'^'^^^k^ is generically nonzero, we can prove 
that the rank of matrix M.'^'^'^l^ is genericahy 7. This rests on the fact that the determinant 
of the 7x7 matrix is generically nonzero, which follows from elementary linear algebra 
(See Appendix. A). 

It is important to note that the 2x2 matrix at the center, being of rank 1, corresponds 
to only one of the extra degrees of freedom in /(T) gravity. However we know that the 
theory contains 3 extra degrees of freedom - so where are the remaining two? 

In fact the five antisymmetrized components corresponding to the tetrad e^'^^'^y {i = 
1, • • • ,5), which satisfy 

S^^'Kaxd^e'^'^^ = 0, (5.16) 

become zero. This means that in the equations for the five tetrads e^^^'^i, the second order 
derivative does not appear and that the characteristic equation should be at most a first 
order differential equation. In other words, the remaining two simply do not show up in 
the characteristic matrix! 

Thus, in order to obtain the complete characteristic we must derive the characteristic 
equation for e'^'^'^jy separately. This is what we will do next for the sake of completeness, 
although it is not a crucial part for our subsequent analysis (since it is sufficient to show 
that the one degree of freedom that does show up in the characteristic matrix, leads to 
problematic behavior). 

In order to derive the characteristic equation for e^*-'";^, we must go back to the equation 
of motion eq. (2.10), which can be written as 

d, {eSrh) - \ee:f - ee^^T'^^S^^r = 0. (5.17) 

If we consider equation of motion from the variation of the action with respect to e^'^^jy, 
we can easily show that the first term in eq. (5.17) does not appear. The first term is in 
fact obtained from a part of action 

5S D y d^x 5e\d^ {eSrh) 

= -J d^x{d^5e\)eSrfT, (5.18) 

and for de"'^, = 5e^'^^°'p it becomes zero due to eq. (5.16). The second term in eq. (5.17) is 
also zero because of antisymmetrization. Eventually, only the last term in eq. (5.17) gives 
a non-zero contribution. 

Although the last term in eq. (5.17) seems to be a higher order equation of the first 
order derivative of e^*^"jy (which, if true, will render the characteristic method unworkable), 
we can show that it is actually linear. Indeed, we can multiply it by — e"°/e for convenience. 
We then have the antisymmetrized equation with respect to a and v 

TP%s}^fT = 0, (5.19) 
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where 



1 / \ 



This gives the two branches of the equation, fT = and 



TP'^^Sp"'' = 0. (5.21) 

If we choose the first branch, i.e., /r = 0, the five equations become degenerate, and thus, 
time evolution cannot be fixed uniquely. This causes trouble regarding the well-posedness 
of the Cauchy problem. Therefore, we choose the second branch. The second order terms 
of the first derivative in eq. (5.21) can be written as 

(_yW^u\^^P + g^i-e.'^eA d^,e^A,^\. = 0. (5.22) 



If both derivative factors are variations of e^^^^i/, eq. (5.22) becomes trivial. Thus at least 
one of the derivative term is not such a variation. This means that eq. (5.21) is not second 
order but first order with respect to dp^e^'^^'^y and the characteristic equation for e^'^'^'^y can 
be shown to be 



(5.23) 

where the notation e'-^^^A is similar to e"^,, i.e., it also represents the change of the frame 
in a certain direction. Eq. (5.23) gives the characteristic matrix for e^*^"^: 

/I ' ' I 1 1 1 1 \ 

_y\^a!^/3A ^ / t g^\'^J'v\i^^ + [/^J-Al/ii/ _^ _j\ctv\\^ _^ _r^ti. ^0\v _^ tjiS^ gAXi' g^\i^ \ 

(5.24) 

Then, we have the exact characteristic matrix which is a combination of the 7x7 matrix 
_y^a,./3A the 5 X 5 matrix M"!;^J^^^y'^^ 

To summarize, the remaining two extra degrees of freedom do not show up in the 
characteristic matrix, since their corresponding characteristic equation only contains one 
derivative term. They can nevertheless be described by another 5x5 characteristic matrix 
We remark that despite the single derivative term here, this does not necessary 
mean that the dynamics of the two extra degrees of freedom is governed by first order 
equations. This may be analogous to the Dirac equation, which is first order and linear. 
However, together with its conjugate equation, one can derive a second order Klein-Gordon 



There is however a priori neither a physical principle nor a mathematical requirement that requires 
us to choose the second branch. This might be another issue of /(T) gravity. 

The cross terms of the 5 components e'^^^^ and the other 7 do not contribute to the characteristic 
matrix, since their components have only one derivative. Since the nonzero components of the 7x7 matrix 
y^/jai^/JA jjg^yg ^-^Q derivatives, the contribution of the cross terms with any smaller number of derivatives 
should not appear in the characteristic equation. 
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equation governing the dynamics of the system. Likewise, Maxweh's equations are first- 
order but combining them gives the second order wave equation. 

In the next section we wih give concrete examples to demonstrate problematic be- 
haviors that could arise in f{T) gravity due to bad characteristics. To appreciate what is 
likely to go wrong with the characteristic, let us look at the characteristic equation eq. (5.9) 
again. We see that the first term is good since it is the only term present in TEGR, and we 
know that characteristics are null in general relativity. In fact the first term gives dynam- 
ics to the 6 metrical components of e^^. The second term quadratic in the field strength 
S'^^yk^ governing one of the extra degrees of freedom however is almost certainly going 
to give a disaster if S'^^^k^'' = for timelike k^. It suggests that with some field values 
the characteristics could leak outside the metric null cone, which in turn means that there 
might be problems with tachyonic modes. This nonlinear behavior is reminiscent of the 
nonlinearity we encountered in the generalized Proca field example. 

Note that the case for TEGR is very different since the theory is locally Lorentz 
invariant, which allows us to choose a gauge that simplifies a lot of the calculation. In any 
case, TEGR corresponds to Jtt = for any value of T and there is no problem with the 
characteristics. 

5.2 Absence of Cauchy Development from a Constant-Time Hypersurface in 
FLRW Geometry 

In this section, we will show that in /(T) gravity the time direction, i.e., {d/dt)^, in 
the FLRW metric is the characteristic direction, and thus the time evolution of the FLRW 
metric is not unique even in the infinitesimal future. We analyze the characteristic equation 
in /(T) gravity which has been derived in the previous subsection. The characteristic 
determinant can be decomposed into two parts, the 7x7 matrix M""^^^ and the 5x5 matrix 
■^(aiitf)' analysis we look into the 7x7 matrix A4""^^^, which has the information 

about all of the six symmetric components and one of the antisymmetric components. The 
antisymmetric component appears only in the second term of the characteristic matrix 
in eq. (5.11), and thus for some k^^, the determinant becomes zero if the value of its 
antisymmetric component is zero. This means that k^ is one of the characteristic directions. 
To see this, it is enough to solve S^xj^^^k^^ = for k^. In the case of the FLRW metric, 
k^ = {d/dty is a solution. 

Let us see this explicitly. We consider the cotetrads 

e^,dx^ = dt, 

e^dx^" = a{t)6tdx\ (5.25) 

These cotetrads give the FLRW geometry with flat spatial section: 

ds^ = g^^dxi'dx" = -dt^ + a^{t)5ijdx'dxK (5.26) 

Given the cotetrads, we can calculate the torsion tensor T'^^ and S^^^ . They are 

T% = mi, (5.27) 
gijt ^ _a-^H6'^, (5.28) 
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while the other components aU vanish. Here H is the Hubble parameter, i.e., H = 
{da/dt)/a. Then, it is trivial that k^' = {d/dty is a solution of S^^''^^'k^ = 0. 

5.3 Concrete Example of Non-Uniqueness of Time Evolution in FLRW Space- 
time 

We have shown that the FLRW solution has a spacelike characteristic hypersurface, and 
thus we cannot properly discuss the time evolution from a constant time hypersurface even 
into the infinitesimal future. Here, we show a concrete solution where the time evolution 
from the FLRW initial condition without torsion is not unique in f{T) gravity with or 
without matter. 

We introduce a matter action 

Sra = j d^x ^gL„^. (5.29) 

Then, equation of motion (2.10) is modified as 

X/ = i^. (5.30) 

We assume that the matter action depends only on the metric but not all components of 
the tetrad. Eq. (5.30) can be rewritten as 

V = (5.31) 

Let us now consider the ansatz 

fP^dx^ = cosh e{t)dt + a{t) sinh 9{t)dx, 
e^^dx^ = sinhd{t)dt + a{t) cosli6{t)dx, 

e^^dx^" = a{t)dy, (5.32) 
e^^dx'^ = a{t)dz, 

which also gives the FLRW metric (5.26). For 9 = 0, this ansatz is equal to eq. (5.25). 
We note that this is a good choice of tetrad despite the lack of local Lorentz invariance in 
the theory. In particular the vector fields are everywhere smooth and linearly independent 
and thus provide a parallelization of FLRW spacetime. This choice of tetrad gives a 
parallelization that belongs to a different equivalence class of parallelizations than the 
usual diagonal tetrad of FLRW spacetime (See the penultimate paragraph of section 2), 
but it is nevertheless a solution to the field equation, as we will show below. Note that of 
course with the choice a{t) = 1, our analysis below is in particular applicable to Minkowski 
spacetime. 

The torsion tensor T'^jy and Sp^'^ are given by 

T\, = -T',, = amO{t), n, = -Tl, = H, T\ = -T^^^ = H5l (5.33) 
5/5 = - = ^^dte{t)5l, S/- = -5/* = H, = -S/ = H51, (5.34) 
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where the other components are zero. Here p,q G {y, z}. From this one readily finds that 



T = 6H^. (5.35) 

Substituting the above frame e'^^, metric, torsion tensor, torsion scalar and Sp^'^ into the 
equation of motion (5.31), we can find that 9{t) does not appear in eq. (5.31). This means 
that if ansatz (5.25) (i.e. 9 = 0) originally satisfies eq. (5.31), for any form of 0{t) the 
ansatz (5.32) is indeed an exact solution of /(T) gravity. 

Now, we can choose the form of the function 6{t) such that 9{t) = for t < and 
6{t) 7^ for t > 0. In this solution, on the spacelike hypersurface t = torsion suddenly 
emerges seemingly from nothingness, and the value of 9{t) ^ for t > is arbitrary. This 
means that — even in the infinitesimal future — time evolution is not uniquely determined 
by the initial data, in other words the Cauchy problem is not well-defined. ^'^^'^ 

Note that this problem does not arise in the case of TEGR due to local Lorentz 
invariance of the theory. 

6 Discussion 

To conclude, in this work we have derived the set of partial differential equations that 
govern the characteristics of f{T) theory of modified gravity, from which we see that there 
will generally be a real danger of superluminal propagation that leaks outside of the metric 
null cone unless /tt = for any value of T, which is the case of TEGR. 

We also took a closer look at the degrees of freedom in the theory. We find that f(T) 
gravity is a highly nonlinear theory in which many things could go wrong. First of all, as 
commented by Li et al. the linearized theory has a different number of degrees of freedom 
than the full theory, which has three extra degrees of freedom compared to TEGR (and 
thus GR). Such nonlinear behavior was further confirmed by careful linear perturbation on 
a flat FLRW background [26], in which none of the three extra degrees of freedom appear 
even in the second order linear perturbation. In this work we note that the characteristic 
equation of /(T) gravity, i.e., eq. (5.9) contains a highly nonlinear second term. 

^^We remind the readers that we work in ( — , +, +, +) convention. The torsion scalar is T = —6H^ if one 
works in the often used convention (+, — , — , — ) in the literature of f{T) cosmology. 

This is probably due to superluminal propagating modes that come in from infinity. One can compare 
this to a similar phenomenon in anti-de Sitter (AdS) spacetime. AdS spacetime is not globally hyperbolic 
since although its (conformal) boundary is infinitely far away as measured along any spacelike path, massless 
particles can propagate to the boundary and back in finite time. In particular this means that, outside the 
Cauchy development, the initial data does not uniquely determine the time evolution, due to information 
that can fiow in from infinity. In our case however, the Cauchy problem is ill-defined not because of a 
peculiar geometry of the spacetime, but because of physical superluminal propagating degrees of freedom. 
In fact it is much worse than the AdS case since one can still at least have Cauchy development locally in 
AdS. 

^^This example of an unpredictable solution has some similarities with an example that demonstrated 
unpredictability in another teleparallel theory, which appeared in the seminal work of Kopczyiiski [33] . 

^*Let us also remark that one could consider an initial tetrad which is like the one in this example only 
in a small region on the initial spacelike hypersurface. Such a tetrad makes an even nastier example. 
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Indeed, we see that generically it excites the six pieces of e''^, essentiaUy e^"'^!.^^ Of 
course, a more careful analysis by Li et al. showed that there are only 3 extra modes instead 
of 6. Regardless, we see that the antisymmetric modes are excited in a highly nonlinear 
fashion that is very prone to causing troubles. In fact, these extra modes are certain to have 
a very singular weak field limit, since the linearized theory has only the metric degrees of 
freedom. The number of degrees of freedom and the number of constraints thus seem to be 
likely to depend on the amplitude of the field. As the tetrad fields approach some specific 
values, some of the Poisson brackets are approaching zero, which in turn means that their 
corresponding "velocity" Lagrange multipliers tend to become unbounded, which signals 
the occurrence of instantaneous, i.e., faster-than-light propagation, much like the situation 
we have exhibited for the nonlinear Proca field (See also the similar discussion in [44].). 
Moreover these extra dynamic parts are likely to propagate outside the metric null cone 
and are also likely to allow for negative energy propagating waves. By further analyzing 
the characteristic matrix of f{T) gravity, we have shown that there is indeed a physical 
superluminal mode that could arise from one of the three extra degrees of freedom. Closely 
related to this issue, we have also demonstrated that the Cauchy problem is ill-posed in 
flat FLRW spacetime and even in Minkowski spacetime. We feel that this is a very bad 
property for any theory of gravity. 

This echoes the remark (Lesson 4) in [69] that extra degrees of freedom in modified 
gravities are prone to give rise to complications that are very hard to control. Although 
that remark was made under the context of f{R) gravity, it apparently also holds true for 
/(T) gravity. We recall that f{T) gravity was often thought of as being more well-behaved 
and easier to deal with compared to f{R) gravity, since its equation of motion is of second 
order instead of fourth order. However it now seems that this advantage comes with many 
trade-offs. For example, unlike its f{R) counterpart, /(T) gravity is not locally Lorentz 
invariant, and thus it is much more difficult to impose an ansatz for a solution a priori 
except in the simplest situations, such as for a flat spacetime. Furthermore, as we pointed 
out in this work, many problems that do not arise in f{R) theory could arise in f{T) 
gravity due to extra degrees of freedom and nonlinearity of the constraints. 

We recall that in [26], it is pointed out that the extra degrees of freedom do not appear 
at the linear perturbation level on a FLRW background, and that this behavior is similar 
to that of nonlinear massive gravity [70] in which, while nonlinear analysis shows that 
there are generically flve gravitational degrees of freedom [71], in the second order action 
on open FLRW background there are only two tensor degrees of freedom [27, 72]. Since 
the hidden degrees of freedom in fact cause nonlinear instability in the case of nonlinear 
massive gravity [73], it is a concern that f{T) theory could exhibit a similar pathology. 
Here we would like to point out another similarity between nonlinear massive gravity and 
/(T) gravity: they both probably exhibit superluminal propagation modes. The case 
for nonlinear massive gravity was studied in [74], in which the authors investigated the 
effect of the helicity-0 mode of the theory and found that energy can probably be emitted 

^® In more general theories with tetrads being the fundamental variables, not only e'"''' but also e''^ can 
be dynamical. 
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superluminously on the self-accelerating background. In the case of the ghost-free Wess- 
Zumino massive gravity [75], Deser and Waldron recently showed that it also exhibits 
superluminal propagation modes [76] . 

Here we should also make some comments on the possibility of negative energy solutions 
in /(T) gravity. In [3] it was shown, directly in terms of the teleparallel variables, that 
TEGR only permits positive energy. In general relativity, this is a well-known theorem 
proved initially by Schoen and Yau [77, 78] and subsequently by Witten [79]. In the TEGR 
formulation the argument was that one can arrange for the Hamiltonian to be dominated 
by positive terms. A key step in the argument was using the local frame gauge freedom to 
remove or at least control the non-positive terms in the TEGR Hamiltonian. But in /(T) 
gravity (and teleparallel theory in general) one does not have local frame gauge freedom. 
Therefore there is no way to kill or control some of the non-positive terms. 

In the positive energy test developed in [80], it is argued that while it is hard to prove 
that the energy for some theory is positive for every solution to the initial value constraints, 
it is not so hard to show that a negative energy solution exists. Indeed it is sufficient to 
show that the initial value constraints admit a non-trivial zero energy solution. If so the 
theory should be discarded. In principle this is a very strong test of a theory (i.e., it could 
exclude many theories which pass other tests). Unfortunately in practice it was not so easy 
to find even one bad solution explicitly, so it is generally hard work getting results beyond 
those of the linearized theory. In any case, to us it seems likely that f{T) gravity would 
be very vulnerable to having a nontrivial non-positive energy solution for most choices of 
/, although it is not expected to be very easy to show this. 

Although our work is purely classical, we would also like to make a brief comment 
regarding strong coupling problem once we consider quantizing the theory: Naively, the 
lack of extra degrees of freedom at the linear level is caused by accidental disappearance of 
certain kinetic terms at the linear level. Vanishing of kinetic terms at the linear level corre- 
sponds to the small limit of their coefficients, which in turn means the nonlinear coupling 
terms of the canonically normalized modes are becoming very strong. Consequently the 
perturbative approach cannot be applied and the theory is out of control at the quantum 
level. Then there is no reason to trust the classical effective action any more [81]. This is 
expected to be a problem if one attempts to naively quantize f{T) gravity. It is also worth 
mentioning that existence of ghosts in a classical theory of gravity may be acceptable if the 
ghost can be pushed to the Planck scale, to be dealt with by quantum theory of gravity 
[82]. 

Finally, it is worth emphasizing that due to the fact that f{T) theory is highly non- 
linear, a full rigorous analysis is at best a very difficult task. One could perhaps still have 
a viable theory if one manages to find a particular choice of / that could avoid or at least 
tame the problems raised in this work in some way. In other words, these seemingly se- 
rious "problems" could in fact be a blessing in disguise, since they may provide a guide 
for narrowing down the viable forms of /. In addition, as commented in [44], it is also 
possible that, somewhat counter-intuitively, this could be an indication that an even more 
nonlinear theory is required. After all, the linearized spin-2 theory of gravity has problems 
that are only cured by nonlinearities of the full theory of general relativity. 
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A Appendix 

Let us write M = {M)ij to denote a matrix M, while Mij denotes the matrix element at 
the i^^ row and j^^ column of M. 

Lemma 1. Let M = {M)ij be an n x n matrix with non-vanishing determinant. Let 
A = {A)ij be a 2x2 matrix with vanishing determinant. Let N = {N)ij be the (n+l) x (n+l) 
matrix constructed from M and A, such that 



(1) 


Nnn = Mnn + Oil, 




(2) 


^n.(n+l) = «12, 




(3) 


N{n+l)n = «21, 




(4) 


^(n+l)(n+l) = 022, 




(5) 


N(^n+i)i = 0, ie{l,2,-- 


■ ,?^} 


(6) 


iv,(„+i) = 0, i e {i,2,-- 




(7) 


Nij = Mij., otherwise. 





Then, detiV = 022 det M. 

Proof. The determinant of M can be calculated straightforwardly by Laplace expansion: 



/O • • • 




+ {M) 



nn 




(A.l) 



VO • • • an 



where r = (0 • • • 012)'^. 
The first term is 



022 (an + Mnn) det (M)(„_i)x(„,_i) + 



(A.2) 
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where remaining terms in the Laplace expansion. The second term is 



Vanishing of determinant of A means that 021012 = 011022- Therefore we see that eq. (A.l) 
simply gives 



□ 

It immediately follows that if 022 7^ 0, then det N ^ since det M ^ 0. This establishes 
the claim in section 5.1, that the determinant for the 7x7 matrix is generically nonzero. 
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